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ABSTRACT 

Nucleotide excision repair (NER) efficiencies of 
DNA lesions can vary by orders of magnitude, for 
reasons that remain unclear. An example is the 
pair of A/-(2'-deoxyguanosin-8-yl)-2-aminofluorene 
(dG-C8-AF) and /V-(2'-deoxyguanosin-8-yl)-2-acety- 
laminofluorene (dG-C8-AAF) adducts that differ by 
a single acetyl group. The NER efficiencies in 
human HeLa cell extracts of these lesions are sig- 
nificantly different when placed at G 1; G 2 or G 3 in the 
duplex sequence (5 -CTCGTG2CG3CCATC-3) con- 
taining the Nar\ mutational hot spot. Furthermore, 
the dG-C8-AAF adduct is a better substrate of NER 
than dG-C8-AF in all three Nar\ sequence contexts. 
The conformations of each of these adducts were 
investigated by Molecular dynamics (MD) simulation 
methods. In the base-displaced conformational 
family, the greater repair susceptibility of dG-C8- 
AAF in all sequences stems from steric hindrance 
effects of the acetyl group which significantly 
diminish the adduct-base stabilizing van der Waals 
stacking interactions relative to the dG-C8-AF case. 
Base sequence context effects for each adduct 
are caused by differences in helix untwisting and 
minor groove opening that are derived from the dif- 
ferences in stacking patterns. Overall, the greater 
NER efficiencies are correlated with greater 
extents of base sequence-dependent local untwist- 
ing and minor groove opening together with weaker 
stacking interactions. 



INTRODUCTION 

Pre-mutagenic bulky DNA lesions are subject to nucleo- 
tide excision repair (NER) that restores the integrity of 
genomic DNA through a 'cut and patch' mechanism 
(1,2). In human global genomic repair of bulky DNA 
lesions (3,4), the DNA damage is recognized by the 
NER factor XPC/Rad23B (5); subsequently, additional 
factors are recruited to the site of the damage, which ul- 
timately leads to the excision of a 24-32 nt long lesion- 
containing oligonucleotide (6-9), followed by gap-filling 
DNA synthesis catalyzed by several DNA polymerases 
(10,11). However, the excision efficiencies of chemically 
different lesions by the mammalian NER apparatus vary 
by several orders of magnitude (1,12-14). The mechanistic 
origin of this phenomenon is of considerable interest par- 
ticularly since lesions that escape repair can survive until 
replication and cause mutations which initiate cancer 
(15-17). A crystal structure of yeast Saccharomyces 
cerevisiae Rad4/Rad23 (18), an ortholog of the mammalian 
XPC/Rad23B, in complex with a DNA duplex containing a 
cyclobutane pyrimidine dimer (CPD), reveals that a 
p-hairpin is inserted into the DNA helix, separating the 
damaged and partner strand, and the two bases opposite 
the disordered CPD are flipped out of the helix and are 
bound to the protein. This structure supports the hypoth- 
esis that sufficient local thermodynamic destabilization 
plays an important role in determining lesion recognition 
and subsequent excision efficiency (12,19-21). 

The carcinogens 2-aminofluorene (AF) and 2-acetyla- 
minofluorene (AAF) have been studied for many 
decades as prototypes of the aromatic amine/amide 
family (22,23). The major reaction products of metabolic- 
ally activated AAF and AF with guanine in DNA 
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include the yV-(2'-deoxyguanosin-8-yl)-2-acetylamino- 
fluorene (dG-C8-AAF) and 7V-(2'-deoxyguanosin-8-yl)- 
2-aminofluorene (dG-C8-AF) adducts (Figure 1A) (24). 
The mutagenicity and tumorigenicity of AF and AAF 
have been studied extensively (22,23) and the susceptibility 
of their adducts to human (25-27) and prokaryotic NER 
(25,28-32) have also been investigated. 

The relationships between the conformations of the 
dG-C8-AAF and dG-C8-AF adducts and their impacts 
on repair and translesion synthesis are of considerable 
interest. The structures of the dG-C8-AF and 
dG-C8-AAF adducts have been under investigation for 
many years (29,32^14). For the case of the AF adduct, 
two conformational themes have been defined by NMR 
methods in B-DNA with normal Watson-Crick base 
pairing (32,36-42). In one conformational family, the 
fluorenyl ring system is situated at the exterior of the 
B-DNA double helix with all Watson-Crick base pairs 
intact, and the damaged guanine remains in the normal 
B-DNA anti glycosidic bond conformation (36,38,40,45). 
A second conformational theme is base-displaced intercal- 
ation; in this case, the damaged guanine adopts the syn 
glycosidic bond conformation, the fluorenyl rings are 
inserted into the helix and both the adducted guanine 
and its partner cytosine are extruded into the major 
groove (41,45). These two conformational families are in 
equilibrium in normally paired DNA duplexes; the 
adjacent and even next-nearest neighbor base sequence 
contexts determine the population balance (37,38). 
Additionally, there is NMR evidence for conformational 
interchange in the orientation of the fluorenyl ring system 
showing that it may rotate ~180° about its long axis 
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(40,41) (rotation about |3'; Figure 1A). A third conform- 
ational family was also determined from high resolution 
NMR studies in which the adducted guanine is syn and the 
fluorenyl ring system is wedged into the B-DNA minor 
groove for the case of G-A (46,47) and G-I (48) 
mismatch structures. These structural families have been 
extensively characterized by CD, fluorescence, F 19 NMR 
and differential scanning calorimetry (29,33). NMR data 
have also provided a model for the dG-C8-AAF adduct in 
the single CG*C sequence context; this work showed a 
base-displaced intercalated conformation for ~70% of 
the population, but the other conformations remained 
structurally undefined (35). Recently, the Cho laboratory 
has revisited the complex conformational properties of the 
dG-C8-AAF adduct, and the mix of conformations and its 
sequence dependence were determined in detail using F 19 
NMR methods. These studies have demonstrated that the 
dG-C8-AAF adduct can assume three different types of 
conformations, namely, major groove, base-displaced 
intercalated and minor groove Wedge, and that the frac- 
tional populations of each state are strongly 
sequence-dependent (32,34). The existence of both the 
Wedge and the major groove conformers had been pre- 
dicted previously on the basis of computational analysis of 
these structures (49,50). 

The sequence S'-CTCGjG^CG^CCATC-S', which 
includes the Narl restriction site in the center (underlined), 
contains an Escherichia coli -2 deletion mutational hot 
spot at G3 (51-53) that is manifested in translesion syn- 
thesis of dG-C8-AAF adducts (51). The -2 deletion 
activity is dependent on the activity of DNA Pol II (53) 
and a recent crystal structure of DNA Pol II provides a 
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b 5' - CTCG 1 G 2 CG 3 CCATC-3' 
3' - G A G C C GC GGTAG-5' 

Figure 1. Chemical structures of the adducts and 12-mer sequence of lesion containing duplexes used in the experiments. (A) Chemical structures of 
dG-C8-AAF and dG-C8-AF. Torsion angles at the linkage site are defined as follows: a', N9 (G*)-C8 (G*)-N-C2; p', C8 (G*)-N-C2-C1; /, C8 
(G*)-N-C-C (methyl); glycosidic torsion of damaged guanine, /, 04'-Cl'-N9-C4. cis (y' = 0°) and trans (Y = 180°) rotamers of the acetyl group 
are shown in the circles. (B) A'arl-hotspot-containing 12-mer duplex used in the NER experiments. Gj, G 2 and G 3 in the duplex denote the guanine 
that is damaged by dG-C8-AAF or dG-C8-AF. Simulated 9-mer sequences are 5'-ACTCG*GCGC-3' for the G,*, 5'-CTCGG*CGCC-3' for the G 2 * 
and 5'-CGGCG*CCAT-3' for the G 3 * duplex (Supplementary Table SI), so that G*, the damaged guanine, is at the center of the simulated 9-mer in 
each case. 
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structural explanation (54). Furthermore, human cell free 
extracts were also recently shown to produce -2 deletions 
induced by the dG-C8-AAF adduct at G 3 in the Narl se- 
quence and these were partially dependent on pol rj (55). 
The solution structures of both dG-C8-AF and dG-C8- 
AAF adducts have also been investigated by NMR 
methods with modification at positions Gi, G 2 and G 3 
of the Narl sequence (Figure IB) and the relative propor- 
tions of the three types of adduct conformations described 
above have been evaluated at each guanine (32,34). Hence, 
this is an excellent model system for investigating the 
effects of adduct structure, conformation and base se- 
quence context on NER susceptibility. 

In this study we determined the relative excision 
efficiencies of both the dG-C8-AF and the dG-C8-AAF 
adducts in the Narl sequence context with the adducts 
positioned at either Gi, G 2 or G 3 (Figure IB) in human 
HeLa cell extracts. Molecular dynamics (MD) simula- 
tions, based on known NMR solution structures 
(35,40,41,49,50), were utilized to analyze the structural 
and dynamic features of each of these adducts in the 
three Narl sequence contexts. The objectives of this 
study were to elucidate how these lesions perturb the 
local double-helical DNA structure and to determine 
whether the structural perturbations can be correlated 
with observed differences in NER efficiencies. Our 
analyses provide a structural explanation for the experi- 
mentally observed differences in NER efficiencies between 
the dG-C8-AAF and dG-C8-AF adducts, and show how 
the NER dual incision efficiencies of each lesion depend 
on the identities of adjacent base pairs at the Gi, G 2 , and 
G 3 positions in the Narl sequence. 

MATERIALS AND METHODS 

Synthesis and separation of oligonucleotides with Narl- 
d*AAF, AarI-G 2 *AAF and A«rI-G 3 *AAF adducts 

The 5'-CTCGGCGCCATC-3' 12-mer sequence was 
reacted with excess of 7V-acetoxy-2-acetylaminofluorene 
(AAAF) in 2mM sodium citrate buffer as described 
(56). Briefly, the three mono-adducted [AAF]-modified 
oligonucleotides were isolated from the reaction mixture 
using reversed phase HPLC methods. The first separation 
was carried out using a linear 5-40% acetonitrile (ACN)/ 
50 mM triethylammonium acetate (TEAA, pH 7.0) buffer 
solution gradient using a semi-preparative PRP-1 column 
(7.0 x 250 mm, Hamilton Company). The second and 
third separations were carried out using a linear 10-24% 
ACN/50mM TEAA (pH 7.0) buffer solution gradient. 
The position of the modified guanine base in each of the 
three mono-adducted oligonucleotides was determined by 
a sequencing method that involves the partial exonuclease 
digestion method described in detail by Kolbanovskiy 
et al. (57). Briefly, in this method, the exonuclease diges- 
tion is arrested at the site of the lesion and the masses of 
the exonuclease-resistant fragments are determined by 
MALDI TOF mass spectrometry, thus establishing the 
positions of the lesions. 

After separation and purification, the pure [AAF]- 
modified oligonucleotides were converted to the 



deacetylated AF-modified oligonucleotides by treatment 
of the solutions with 1 M NaOH containing 0.3% (v/v) 
2-mercaptoethanol for 45min at room temperature, 
followed by separation of the sequence-isomeric oligo- 
nucleotides and their purification using reversed phase 
HPLC and CI 8 ODS-Hypersil columns with a linear 
12-30% ACN/50mM TEAA (pH 7.0) buffer solution 
gradient. 

The purified oligonucleotides containing the single 
dG-C8-AF or dG-C8-AAF adduct were 32 P-5'-endlabeled 
and incorporated into 135-mer oligonucleotide duplexes 
by standard ligation methods and then added to 
NER-competent cell extracts, as described in detail by 
Kropachev et al. (58). In all cases, the lesions were pos- 
itioned at the 67th nucleotide counted from the 5' side. 
Supplementary Figure SI shows the 135-mer sequences 
and other experimentally relevant supplementary data 
are given in Supplementary Figures S2 and S3. 

Human NER assays in HeLa cell extracts 

The NER assays were conducted as described earlier by 
Kropachev et al. (58). In brief, each reaction contained 
1 pmol of P 32 -internally labeled 135-mer duplex with an 
adduct at the 67th nucleotide counted from the 5'-side in 
an aqueous mixture generated by mixing 17.5 ul of a 1 M 
KC1 solution with 20 ul of a 10 mM Tris-ATP (pH 7.9) 
and 10 ul freshly prepared cell extract solution containing 
40^15 ug of protein, and sufficient dialysis buffer (12 mM 
MgC12, 25 mM Hepes-KOH pH 7.9, 2.5 mM Dithio- 
threitol, 1 mM EDTA 10% glycerol) to produce a final 
volume of 50 ul. After appropriate incubation time and 
subsequent preparation of the samples (58), the reaction 
products were loaded onto a 12% polyacrylamide, 8M 
urea denaturing gel. The gels were dried, exposed and 
analyzed using a Storm 840 phosphoimager and 
Image-Quant software. 

MD simulations 

In order to obtain molecular explanations for the experi- 
mental findings, we carried out extensive MD simulations. 
We investigated all structural families that have been ex- 
perimentally observed by NMR methods for dG-C8-AF 
and dG-C8-AAF-modified duplexes (35,40,41) and 
studied each structural family at each of the guanines, 
G[ G 2 and G 3 (Figure IB), of the Narl sequence 
context. The lesions were embedded in duplex 9-mers 
with the damaged guanine centrally positioned in each 
case. The families for dG-C8-AF are: major groove and 
base-displaced intercalated, and for dG-C8-AAF: major 
groove, base-displaced intercalated and minor groove 
Wedge. We did not include the Wedge conformation for 
dG-C8-AF since this conformation has been observed 
only in duplexes containing mismatches opposite the 
damaged guanine (46-48). We also investigated for the 
dG-C8-AF case the rotamers about the long axis of 
the fluorenyl moiety (governed by the torsion angle P', 
Figure 1A), where such rotamers had been observed ex- 
perimentally for the base-displaced and major groove con- 
formational families (37). The initial models were 
constructed based on NMR solution structures for all 
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cases except for the dG-C8-AAF major groove and Wedge 
conformers, which were obtained by modeling (49,50); we 
added the acetyl group (cis rotamer, Figure 1A) to the 
dG-C8-AF major groove and Wedge NMR structures 
(37), and replaced the mismatched partner A to the 
modified guanine (G*) with a C for the Wedge. Figure 2 
shows the central 5-mers of these structures for the G 2 * 
duplex. Supplementary Table SI summarizes all 21 simu- 
lations for the modified duplexes. An unmodified control 
(5'-CTCGGCGCCA-3') was also studied. MD simula- 
tions of all structures with explicit water and counterions 
were carried out for 50 ns using the AMBER9 simulation 
package (59). Full details of the molecular modeling, force 
fields (Supplementary Table S2) and MD protocols are 
given in the Supplementary Materials and Methods. All 
simulations achieved good stability after 20.0 ns 
(Supplementary Figure S4). Structural ensembles from 
20.0 ns to 50.0 ns were employed for analyses, except for 
analyses for the p" torsion angle of all dG-C8-AF-modified 
duplexes, for which the first 20 ns was retained to preserve 
the initial state following equilibration to investigate the 
possibility of rotation during the simulations 
(Supplementary Figure S5). The PTRAJ module of 
AMBER 9 (59) and AMBER 10 (60), the CARNAL 
module of AMBER 7 (61) and the Curves+ software 
package (62) were employed for structure and energy 
analyses. Details of the analyses are provided in the 
Supplementary Materials and Methods. PyMOL 
(Delano Scientific, LLC) (63) was employed to make mo- 
lecular images and movies. 

RESULTS 

Nucleotide excision repair assays 

The incubation of 135-mer Gi*, G 2 * and G 3 * duplexes 
containing single dG-C8-AF and dG-C8-AAF lesions in 
HeLa cell extracts gives rise to the appearance of radio- 
actively labeled oligonucleotide dual incision fragments 
24-32 nt in length. The \0R (+)-cis-anti-B[a]P-N 2 -dG 



lesion, also embedded at the 67th nucleotide in the 
135-mer duplexes, is an excellent NER substrate (64,65) 
and was used as a positive control of NER activity 
(Figure 3A). Comparisons of the relative extents of NER 
were calculated from the integration of the radioactivity in 
the 25-32 oligonucleotide dual excision fragment region 
divided by the total radioactivity in each lane of the gel 
autoradiogram; a typical example of such a gel is shown in 
Figure 3A. The results with all seven modified oligonucleo- 
tide substrates were performed with one and the same cell 
extract. The base line radioactivity signals were subtracted 
when the NER efficiencies were low as in the case of the 
dG-C8-AF-modified G 2 * duplexes. In order to achieve sat- 
isfactory reproducibility from day-to-day, and to take into 
account the varying activities of different cell extracts 
prepared at different times, the NER efficiencies in each 
gel autoradiogram were normalized to the value obtained 
with the 107? (+) c«-a«f/-B[a]P-A^ 2 -dG-modified 135-mer 
duplex after an incubation time of 45min. These NER 
efficiencies were arbitrarily assigned a value of 100 and 
all other NER efficiencies are reported relative to this 
value. The dual incision kinetics, determined from the frac- 
tions of the 24-32 nt long dual incision fragments excised 
from the 135-mer duplexes, are depicted in Figure 3B and 
C. Each value is the average of six independent experi- 
ments relative to the dual incision efficiency obtained at 
the 45min time points with the 10R (+)-cis-anti-B[a]P- 
7Y 2 -dG-modified 135 -mer duplexes; the actual NER effi- 
ciency at this time point for this ris-lesion was 
6.2 ± 2.3% (average of six experiments). The linear time 
dependence of dual incision products suggests that the 
NER efficiency is not limited by the availability of NER 
proteins that cooperate in generating the dual incision 
efficiencies. Using similar substrates with G*CG* 
intrastrand cisplatin-derived cross-linked lesions which 
are better NER substrates (dual incision product yields 
in the range of ~10%) than the lesions of interest in this 
work, we find that the yields of incision products are also 
proportional to the damaged DNA substrate concentra- 
tion (data not shown). Overall, these observations are 
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Figure 2. Generic structural features of conformational families in dG-C8-AAF and dG-C8-AF-modified duplexes. Shown are the initial structures 
for MD simulations for dG-C8-AAF and dG-C8-AF-modified G 2 * duplexes. Central 5-mers of the structures are viewed from the major groove side. 
(A) Base-displaced, major groove and Wedge conformers of the dG-C8-AAF-modified G 2 * duplex. (B) Base-displaced and major groove conformers 
of dG-C8-AF-modified G 2 * duplex. The P'rotamer 1 for each of the base-displaced and major groove conformers is illustrated. The G* and the 
fluorenyl rings of the P' rotamer 2 are shown in the circles at the bottom. These rotamers represent the rotation along the long axis of the fluorenyl 
rings. The P' values of Rotamer 1 and Rotamer 2 are given in Supplementary Table SI. The most representative structures from the MD ensembles 
are shown in Figures 7, 9, 10, Supplementary Figures S6, S12 and S14. All structures are represented in sticks: hydrogen and phosphate oxygen 
atoms are deleted for clarity, except for the hydrogen atoms of the acetyl groups. The lesion-containing strands and the complementary strands are in 
light gray. The fluorenyl rings are red, the acetyl group of AAF is purple and the acetyl oxygen is red. The modified guanines and partner C bases are 
green, and the neighboring bases are blue. In the nomenclature of the Cho laboratory, the major groove conformation is designated 'B', the 
base-displaced intercalated conformation is designated 'S' and the Wedge conformation is designated 'W (32). 
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Figure 3. Nucleotide excision repair of dG-C8-AAF and dG-C8-AF lesions incorporated at either Gi, G2, or G 3 in the 12-mer Narl sequences 
embedded in the center of 135-mer duplexes as a function of incubation time in extracts from human HeLa cells. (A) Autoradiograph of represen- 
tative denaturing (8 M urea) 12% polyacrylamide gel. The 10R (+)-cis-anti-B[a]P-N 2 -dG adduct [(+)-cis] in the 5'-CCATCG*CTACC-3' sequence 
context was included in these experiments as a positive control (58) and as a measure of the variable NER activities of different cell extracts (see 
text). The lanes marked 'M' show the mobilities of oligonucleotide length markers. The exact sequence of the 135-mer is shown in 
Supplementary Figure SI. Time dependence of fractions of dual incision products in the 20-30 nt long oligonucleotide range as a function of 
incubation time for (B) dG-C8-AAF Gi*, G2* and G3* duplexes and (C) dG-C8-AF G|*, G 2 * and G 3 * duplexes. Inverted triangles (cyan): (+)-cis- 
adducts; squares (black): modified G[* duplexes; circles (red): modified G 2 * duplexes; triangle (blue): modified G 3 * duplexes. The individual data 
points are averages of six independent experiments normalized with respect to the control DNA substrates, expressed as a percent of the dual incision 
result obtained with the (+)-cis adducts at the 45 min time point in each experiment and assigned an arbitrary value of 100%; the actual measured 
NER efficiencies at this time point were in the range of ~ 3-9% with an average of 6.2% expressed in terms of (radioactivity in the 20-30 nt range)/ 
(total radioactivity) x 100. The straight lines represent the least-squares fits to the data points. (D) Relative rates of dual incisions estimated from the 
slopes of the straight lines in (B) and (C). In the case of the dG-C8-AF-modified G 2 * duplex substrate, only the relative rate of the slower component 
(~92-96% of the sample) is shown; a faster, minor component is incised within the first 15 min and its rate was not further characterized. 



consistent with the notion that the differences in NER 
efficiencies observed (Figure 3) are associated with a rec- 
ognition step in the multi-step NER process that is rate 
limiting (7,14,66,67). Most recently, the work of Luijs- 
terburg et al. (67) showed that lesion recognition is 
strongly rate limiting for mammalian NER in living cells 
based on extensive kinetic measurements of binding and 
dissociation of individual components of the NER machin- 
ery utilizing live cell imaging. 

When the fractions of dual incision products increase 
linearly with incubation time and the best straight line 
drawn through the data points intercepts the vertical 
axis at the zero point, either a homogeneous adduct popu- 
lation, or a conformationally heterogeneous population 
with overall indistinguishable NER efficiencies is 
indicated. However, if the best straight line intercepts 



the vertical axis with an apparent NER efficiency greater 
than zero, a conformational heterogeneity of adducts with 
different NER efficiencies is indicated. In the case of 
the dG-C8-AAF-modified G,*, G 2 *, G 3 * and the 107? 
(+)-cis-anti-B[a]P-N 2 -dG -modified duplexes, the time de- 
pendence of dual incision product formation is linear 
(Figure 3B). In the case of the dG-C8-AF-modified Gi* 
and G 3 * duplexes, the best least squares fit is also linear, 
although the straight lines calculated are close to but do 
not exactly intersect the vertical axis at the zero-point 
(Figure 3C). However, the time-dependence of product 
formation is strongly non-linear in the case of the 
dG-C8-AF G 2 * duplex; this is clearly evident from 
the non-zero vertical axis intercept of the best straight 
line drawn through the experimental data points. 
This suggests that the dG-C8-AF G 2 * duplex, while 
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structurally homogeneous, is conformationally heteroge- 
neous. The non-linearity suggests that this substrate 
contains two or more conformationally different dG-C8- 
AF G2* duplex structures, which are characterized by 
markedly different NER efficiencies. The vertical axis 
intercept of the dG-C8-AF G 2 * sample is ~5% which is 
approximately equal to the more rapidly repaired fraction, 
while the major fraction is the most slowly repaired among 
all of the lesions observable in the experiments. 

The initial rates of dual incisions are compared in the 
bar graph in Figure 3D. Overall, there is a difference by a 
factor of -13 for the best (dG-C8-AAF G 2 *) and the 
worst (dG-C8-AF G 2 *) NER substrates. The dual 
incision rates are the same for the dG-C8-AAF G 2 * and 
the \0R (+)-cis-anti-B[a]P-N 2 -dG -modified DNA 
duplexes within experimental error. The relative NER 
rates for the dG-C8-AAF Gi* and dG-C8-AAF G 3 * 
duplexes are the same within experimental error as well 
and are about 1.8 times smaller than the dG-C8-AAF G 2 * 
rates. It is noteworthy that our results for the dG-C8-AAF 
adduct with HeLa cell extracts follow the same trend as 
those obtained earlier by Mu et al. (25), with the 
dG-C8-AAF G 2 * duplex being repaired significantly 
more efficiently than the dG-C8-AAF Gi* and 
dG-C8-AAF G 3 * duplexes. Thermal melting studies 
(Supplementary Figure S2) also show that the 
dG-C8-AAF adducts destabilize double-stranded DNA 
more extensively than the dG-C8-AF adducts as has 
been observed previously (33,68). The NER rates are uni- 
formly lower for all dG-C8-AF adducts than for all three 
dG-C8-AAF adducts, and are similar in value for the 
dG-C8-AF-modified Gi* and G 3 * duplexes. The NER 
efficiencies of these duplexes are a factor of ~3.5 greater 
than the major fraction (95%) of the dG-C8-AF-modified 
G 2 * duplex, which is the least readily repaired one. 

Molecular modeling and MD simulations 

In order to provide molecular explanations for the experi- 
mental NER results, we carried out MD simulations for 
duplexes containing the dG-C8-AF and dG-C8-AAF 
adducts at the Gi, G 2 and G 3 positions of the Nar\ 
sequence (Figure IB) that were studied experimentally. 
For the dG-C8-AF adduct, we simulated the base- 
displaced intercalated and major groove conformers, 
including rotamers of (3' (Figure 2) that were experimen- 
tally observed (37). For the dG-C8-AAF adduct, we 
simulated these two conformations and also the minor 
groove Wedge conformer, as these are the conformers 
observed for each adduct by F 19 NMR in normally 
Watson-Crick base paired duplexes (32,34,35,37,38). We 
also simulated an unmodified control duplex. Supple- 
mentary Table SI summarizes our 21 simulations. We 
carried out detailed analyses of the MD trajectories 
between 20 and 50 ns, during which the simulations were 
stable, as shown by the RMSD data of each 50 ns MD 
simulation (Supplementary Figure S4). 

Earlier results have shown that bulky polycyclic 
aromatic B[a]P-derived DNA lesions with base-displaced 
intercalated conformations (e.g. the \0R ( + )-cis-anti- 
B[a]P-7Y 2 -dG lesion employed as a control in the above 



experiments) are better substrates of NER than those pos- 
itioned in the minor groove (26) or those with classical 
intercalative conformations without base displacement 
(69,70). Other examples of base-displaced intercalated 
adducts that are also excellent substrates of NER are the 
dG-C8-AAF (12,25), the 7V-(deoxyguanosin-8-yl)- 
2-amino-l-methyl-6-phenylimidazo[4,5-A]pyridine (dG- 
C8-PhIP), (71) and the 7V-(deoxyguanosin-8-yl)- 
1-aminopyrene (dG-C8-AP) adducts (72). In order to 
further establish the validity of this paradigm, we 
analyzed the structural and energetic features of all the 
conformational families of the dG-C8-AF and 
dG-C8-AAF adducts in all three Narl sequence contexts 
(base-displaced and major groove for dG-C8-AF, and 
base-displaced, major groove and minor groove Wedge 
for dG-C8-AAF). We found that the base-displaced con- 
formers induce the most prominent structural and ener- 
getic distortions in the modified DNA duplexes, namely 
ruptured Watson-Crick hydrogen bonding, stacking de- 
stabilization, untwisting and minor groove enlargement 
which are all hallmarks of DNA lesions that are good 
substrates of NER (21,69). 

The following sections describe the specific structural 
distortions of the different conformational families 
depicted in Figure 2. The results reveal that the 
base-displaced intercalated DNA lesions are the most ex- 
tensively distorted, and that the distortions correlate with 
relative repair susceptibilities of dG-C8-AAF and 
dG-C8-AF and their respective sequence context depend- 
ence. By contrast, the major groove and Wedge conform- 
ations are shown to be much less distorted. 

The major groove and Wedge conformational families are 
little distorted compared with the unmodified control 

As shown in Figure 4, for both dG-C8-AAF and 
dG-C8-AF-modified duplexes, the major groove confor- 
mers retain normal B-DNA Watson-Crick base pairing 
and strong van der Waals stacking interactions that are 
comparable to the unmodified control. Furthermore, for 
dG-C8-AAF-modified duplexes, the Wedge conformers 
also have close to normal stacking (Figure 4), and one 
strong non-Watson-Crick hydrogen bond between the 
G* and partner C, as well as a second weaker one in 
two of the three sequences (Supplementary Figures S6, 
S7 and Supplementary Table S3). Furthermore, as 
shown in Figure 5, the helical twist and minor groove di- 
mensions of these major groove and Wedge conformers 
(Supplementary Figures S8, S9 and Supplementary Tables 
S4, S5) are much closer to normal than for the 
base-displaced ones as discussed below. 

Higher repair susceptibility of dG-C8-AAF than 
dG-C8-AF adducts in their base-displaced 
intercalated conformations is correlated with weaker 
stacking interactions accompanied by more untwisting 
and minor groove enlargement 

Significantly weaker stacking interactions in the 
base-displaced dG-C8-AAF duplexes 
The importance of perturbations of the normal DNA base 
stacking interactions by DNA lesions, such as one 
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Figure 4. Stabilizing local van der Waals stacking interaction energies 
of the major groove and Wedge conformers are comparable to the 
unmodified control. For the major groove conformers, the van der 
Waals interaction energy between the lesion base pair and its neighbor- 
ing base pairs are computed. For the Wedge conformers, the stacking 
interaction energy also includes the interaction between the fluorenyl 
rings and the DNA backbone residues in contact with it in the minor 
groove (detailed in Supplementary Materials and Methods). For the 
unmodified control, the mean trimer value over all three sequences is 
shown for clarity, since the individual values of the three sequences are 
very similar (Supplementary Table S5). The mean values and standard 
deviations of the block averages (Supplementary Materials and 
Methods) for the stacking interaction energies of the major groove 
and Wedge conformers are given in Supplementary Table S4. 



containing a crosslink between guanine and thymine in 
DNA duplexes (73), as well as the potential stabilizing 
effects associated with interactions between aromatic 
rings of carcinogens with DNA bases has been considered 
recently (69,71,74); evidence was presented that weakened 
van der Waals stacking interactions correlate with 
enhanced NER susceptibility whereas enhanced stacking 
interactions result in diminished excision efficiencies. 
Accordingly, we evaluated the van der Waals stacking 
interaction energies for all simulated base-displaced struc- 
tures. These energies are 10-13 kcal/mol less stabilizing for 
the base-displaced dG-C8-AAF duplexes than for the 
dG-C8-AF duplexes (Figure 6 and Supplementary 
Table S6). 

The origin of the large difference in stacking energies 
between the dG-C8-AF and dG-C8-AAF adducts is due 
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Figure 5. The structural features of twist and minor groove width in 
the Major groove and Wedge conformers of the modified duplexes are 
close to the normal values of the unmodified duplex. (A) Definition of 
helical untwisting in modified duplexes. The twist angles of the 
modified duplexes [Twist (mod)] are between the two base pairs sur- 
rounding the G*, and represent the local impact on helix twisting 
imposed by the adduct. The twist angle of the unmodified control 
[Twist (unmod)] is the average of the two-step twist angles of the 
CGG, GGC and CGC trimers, but all are very similar 
(Supplementary Table S5). The unmodified and modified trimers are 
rendered as in Figure 2. Helical axes are shown as black dashed lines, 
and the base pair planes as transparent rectangular plates. 
(B) Untwisting angle values of the major groove and Wedge confor- 
mers. The untwisting angles are computed as in (A). The mean values 
and standard deviations of the block averages (Supplementary 
Materials and Methods) for the untwisting angles are shown. (C) 
Differences between minor groove widths of modified duplexes and 
unmodified control for major groove and Wedge conformers. Minor 
groove difference = Minor groove width modified — Minor groove width 
unmodified- For the modified duplexes the minor groove width is 
measured at the lesion site between P7 — PI 6 and for the unmodified 
control the minor groove width is measured at analogous positions 
(Supplementary Figure S9). The mean values and standard deviations 
of the block averages (Supplementary Materials and Methods) for the 
minor groove differences are shown. 



to the acetyl group in the latter adduct. Although the 
fluorenyl rings intercalate into the helix and stack with 
neighboring base pairs in both the dG-C8-AAF and 
dG-C8-AF base-displaced duplexes, the modified 
guanines are positioned differently. In the dG-C8-AAF 
case, the modified guanine G* is displaced into the 
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Figure 6. Stacking stabilization is much stronger for dG-C8-AF than 
dG-C8-AAF in the base-displaced duplexes. Shown are the population 
distributions with mean values of the van der Waals stacking inter- 
action energies between lesion and adjacent bases in base-displaced 
dG-C8-AAF and dG-C8-AF duplexes. For the dG-C8-AF duplexes, 
the lesion guanine, G*, is included (Supplementary Materials and 
Methods) to account for its stacking with adjacent bases. These 
results are for the (3' rotamer 1; similar results were obtained for P' 
rotamer 2 (Figure 2B and Supplementary Table S6). 



major groove (Figure 7A) and is aligned nearly perpen- 
dicularly to the fluorenyl rings. This alignment is dictated 
by steric restraints posed by the acetyl group and facili- 
tates G* forming two hydrogen bonds with the backbone 
(Figure 7A, Supplementary Table S3). In this position, the 
G* is in close steric contact with its own deoxyribose 
residue whose orientation therefore becomes distorted so 
that it is perpendicular to the helix axis rather than in the 
normal parallel position (Figure 7A and Supplementary 
Movie SI) and its sugar pucker oscillates between different 
conformations (Supplementary Figure SI OA). However, 
in the dG-C8-AF case, without the acetyl group, the 
displaced G* is closer to being co-planar with the fluorenyl 
rings and is partly stacked with its 5' neighboring base on 
the same strand (Figure 7B and Supplementary Movie 
S2); its sugar residue is oriented normally with stable 
puckering (Supplementary Figure SI OB). Thus, the 
modified guanine base G* provides additional stacking 
stabilization and, being unrestrained, also allows the 
fluorenyl rings to reach further across to the complemen- 
tary strand, which also improves the stacking interactions. 
The G*/fluorenyl ring orientational difference in dG-C8-AAF 




Partner C 



Partner C 



Partner C 



Figure 7. The acetyl group imposes a different base-displaced 
intercalated conformation in dG-C8-AAF than in dG-C8-AF. Note 
the different positions of G*. For illustration the G 3 * duplexes are 
presented. (A) Most representative structures of dG-C8-AAF-modified 
G 3 * duplex with acetyl group in cis and trans conformation (Figure 1). 
G 3 * was selected for this illustration because both cis and trans 
rotamers of the acetyl group were observed in the simulations (see 
Supplementary Results). Stereo views are available in Supplementary 
Figure S14. See also Supplementary Movies SI and S3. (B). Most rep- 
resentative structure of dG-C8-AF-modified G 3 * duplex. For both 
adducts, the Gi* and G 2 * duplexes have overall similar intercalation 
structures as in the G 3 * duplex. See also Supplementary Movie S2. The 
central trimers are viewed from the major groove side and represented 
as in Figure 2, except that the phosphate oxygen that forms hydrogen 
bonds with G* is shown in red stick. The hydrogen bonds between G* 
and backbone phosphate oxygen are represented using black dashed 
lines. The acetyl group is shown in transparent CPK (purple and red). 



and dG-C8-AF is shown in crystal structures containing 
G-AAF (75,76) and base-displaced dG-C8-AF (77), as well 
as in the high resolution NMR structures of dG-C8-AF 
adducts in double-stranded DNA (37). 

Greater untwisting and minor groove enlargement in 
base-displaced dG-C8-AAF duplexes 

Untwisting and minor groove enlargement may contribute 
to distortions and destabilizations that can facilitate NER 
(21). Untwisting at the lesion site is reflected in a decrease 
of the twist angle, relative to the unmodified control, 
between the 5' and 3' base pairs that are adjacent to the 
lesion (Figure 5A). The smaller the twist angle, the more 
untwisted the duplex. We note that intercalation is always 
associated with helix untwisting (78,79), the extent of 
which depends on the structure of the intercalated 
moiety (69). For the dG-C8-AAF and dG-C8-AF 
adducts, the backbone untwists so that the base pairs 
adjacent to the lesion are optimally stacked with the 
elongated fluorenyl moiety. However, the untwisting in 
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the dG-C8-AAF modified duplexes is always greater than 
for the dG-C8-AF adducts (Figure 8A and B) (the differ- 
ence is sequence dependent, as discussed further below). 
This greater untwisting relative to the unmodified duplex 
in the dG-C8-AAF case results from the steric restraints 
imposed by the acetyl group (Figure 7A and 
Supplementary Movie SI), which impedes the free adjust- 
ment of the backbone twist. In each case, the untwisting is 
accompanied by minor groove enlargement, which is also 
greater for dG-C8-AAF in all sequences (Figure 8C). The 
correlation of these properties is shown in Supplementary 
Figure SI 1. 

Sequence-dependence of stacking patterns in 
base-displaced intercalated conformations produce 
differences in untwisting and minor groove opening 
that correlate with the base sequence dependence of 
NER dual incision efficiencies 



The G2* sequence is the best repaired for dG-C8-AAF 

The experimental results showed that G 2 * is incised 
roughly twice as efficiently as Gi* or G 3 * (Figure 3D). 
All show homogenous initial repair rates (Figure 3B). In 
order to gain insights into this sequence effect, we 
analyzed the sequence dependence in structural properties 
of the dG-C8-AAF duplexes manifested in our dynamics 
simulations. The governing feature is the sequence de- 
pendence of stacking patterns between the fluorenyl ring 
system and adjacent base pairs. 

In the dG-C8-AAF adducts, stacking is restricted by 
steric effects associated with the acetyl group, which 
prevents stacking interactions between the modified 
guanine G* and adjacent bases and inhibits the ability 
of the fluorenyl ring system to move closer to the 
opposite strand (Figure 7A). In the Gi* duplex, the 
optimal stacking pattern involves the stacking of 
the fluorenyl rings with the cytosine on the 5' side of 
G* and the partner strand cytosine on the 3' side 
(Figure 9 A and Supplementary Figure S12A). In the 
G 2 * duplex, optimal interactions are achieved by 
the stacking of the fluorenyl rings with the guanine on 
its 5' side and the partner strand guanine on its 3' side 
(Figure 9B and Supplementary Figure S12B). In the G 3 * 
case, optimal stacking involves overlap of the fluorenyl 
rings with the cytosine on the 5' side of G* and the six 
membered ring of the partner strand guanine on the 3' side 
(Figure 9C and Supplementary Figure S12C). The 
cisjtrans conformational possibilities of the acetyl group 
(Figures 1A and 7A) play a role in modulating optimal 
stacking in each of the three Nar\ sequence-containing 
duplexes. Details are given in Supplementary Figures 
S13-S14 and Movie S3. 

Achieving the optimal stacking interactions in each 
sequence requires different adjustments in untwisting as 
well as different minor groove enlargements. The G 2 * 
duplex is characterized by the greatest untwisting (i.e. it 
exhibits the lowest twist angle between the base pairs sur- 
rounding the intercalated fluorenyl rings) (Figures 8B 
and 9B), whereas the degree of untwisting is significantly 
lower as well as similar in the Gi* and G 3 * duplexes 
(Figures 8B, 9A and C). Compared with the analogous 
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Figure 8. The extent of untwisting and minor groove enlargement for 
the base-displaced conformations is greatest for dG-C8-AAF at G 2 * 
and least for the proposed slowly repaired population of dG-C8-AF 
at G2*. (A) Population distributions of twist angles for the 
base-displaced conformers and that of the unmodified control. (B) 
Untwisting angles for the base-displaced conformers. The untwisting 
angle is calculated as in Figure 5B. The mean values and standard 
deviations of the block averages (Supplementary Materials and 
Methods) for the untwisting angles are shown. (C) Minor groove en- 
largement of the base-displaced conformers. Minor groove 
enlargement = Minor groove width mo< jified — Minor groove width un _ 
modified- The minor groove widths are measured as in Figure 5C. The 
mean values and standard deviations of the block averages 
(Supplementary Materials and Methods) for the minor groove differ- 
ences are shown. For dG-C8-AF at G 2 *, standard deviations are for all 
structures of the less untwisted population (Supplementary Figure S16). 



twist angle of the unmodified control (Figure 9D), all 
three sequences are significantly untwisted. In each case, 
untwisting is accompanied by concomitant minor groove 
enlargement, the extent of which follows the order of un- 
twisting angles (Figure 8B and C; Supplementary Table 
S6). The differences in these distorting features are 
correlated with the sequence dependence of the relative 
NER susceptibilities, with the most untwisting and 
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Figure 9. Views along the helix axis reveal sequence-dependent differences in stacking patterns of the base-displaced dG-C8-AAF-modified duplexes. 
The stacking pattern differences determine the extent of untwisting. Central trimers of the most representative structures are shown. 
(A) dG-C8-AAF-modified G,* duplex. (B) dG-C8-AAF-modified G 2 * duplex. (C) dG-C8-AAF-modified G 3 * duplex. (D) Twist angle of the un- 
modified control in the representative 5'-GGC-3' trimer. Other trimer sequences are similar in twist angle (Supplementary Table S5). The twist angle 
between the 5' and 3' base pairs is shown in orange shade with the mean value of the twist angle for each population labeled. The greater the twist 
angle, the less the untwisting relative to the unmodified duplex. The modified structures are rendered as in Figure 7A without CPK. The unmodified 
control is rendered as in Figure 2. Stereo views are shown in Supplementary Figure S12. 



greatest minor groove opening occurring at the most 
repaired dG-C8-AAF lesion, at G 2 * (Figure 3D). In 
accord with the homogenous repair rates, the population 
distributions of twist angles manifest homogenous popu- 
lations for all three duplexes (Figure 8A). 

The dG-C8-AF adduct at G2* exhibits a short rapidly 
repaired phase and a predominant slowly repaired phase 

The experimental NER results reveal that heterogeneity is 
manifested to a significant extent only in the case of the 
dG-C8-AF adduct at G 2 * (Figure 3C). Moreover, the dual 
incision efficiency of G 2 * in its dominant phase is about 
one-third that of Gi* and G 3 *, which are repaired with 
approximately equal efficiencies (Figure 3D). 
Interestingly, the two repair phases are consistent with a 
heterogeneous population distribution of twist angles 
between the base pairs 5' and 3' to the lesion, observed 
only in the case of the dG-C8-AF adduct at G 2 *, but not 
at Gj* and G 3 * (Figure 8A). 

With the same dG-C8-AF adduct positioned at Gi* and 
G3*, the distributions for the twist angles revealed a 
homogenous population for each case (Figure 8A). The 



most representative adduct structures for the Gj* and G 3 * 
duplexes are shown in Figure 10A and B. In the 
dG-C8-AF-modified Gi* duplex, the base 5' to the G* is 
a C; this C stacks very weakly with the modified guanine; 
hence the fluorenyl ring system is free to adjust its position 
to maximize stacking interactions with the partner strand 
bases, allowing strong overlap between the fluorenyl rings 
and the 3'-C and 5'-G partner strand bases (Figure 10A). 
This stacking pattern results in significant untwisting of 
the Gi* duplex (Figure 8B), accompanied by minor 
groove opening (Figure 8C). In the case of the G 3 * 
duplex, the G* stacks only partially with its 5' side C on 
the same strand, and the fluorenyl rings can thus optimize 
stacking interactions with the partner strand bases: the 
fluorenyl moiety achieves extensive stacking with the 
5' G, as well as a perfect one ring overlap with the 3' G 
in the partner strand (Figure 10B). This stacking pattern 
produces untwisting and minor groove opening of the 
dG-C8-AF-modified G 3 * duplex quite close to that of 
the analogous Gi* duplex (Figure 8B and C). 

The two distinct twist angle populations observed only 
in the case of the dG-C8-G 2 * duplex are characterized by 
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Figure 10. Views along the helix axis reveal sequence-dependent differences in stacking patterns of the base-displaced dG-C8-AF-modified duplexes. 
The stacking pattern differences determine the extent of untwisting. Central trimers of the most representative structures are shown. (A) 
dG-C8-AF-modified G,* duplex. (B) dG-C8-AF-modified G 3 * duplex. (C) The more untwisted population of dG-C8-AF-modified G 2 * duplex. 
(D) The less untwisted population of dG-C8-AF-modified G 2 * duplex. The structures are rendered as in Figure 2. The twist angles between the 5' and 
3'base pairs are represented as in Figure 9. 



a more untwisted population with a mean twist angle 
value of ~45° and a second much less untwisted one 
with a mean twist angle value of ~60° (Figure IOC 
and D; Supplementary Figure SI 6). These two popula- 
tions display different extents of protrusion of the 
fluorenyl rings into the minor groove. Figure 10C and D 
shows representative structures of each of these twist 
populations, illustrating the different positions of the 
fluorenyl rings. In the population with twist angle of 
~60°, the modified guanine and its 5'-G are well 
stacked, at the expense of poorer stacking by the fluorenyl 
rings, which extend further into the minor groove (Figure 
10D). The population with twist angle of ~45° has weaker 
stacking between the modified guanine and its 5'-G and 
well-stacked fluorenyl rings (Figure 10C); it resembles the 
structures of the Gi* and G 3 * duplexes in Figure 10A and 
B. However, the ~60° twist angle population is only 
modestly untwisted (~5°, Figure 8B) with the lowest 
minor groove enlargement (Figure 8C). We hypothesize 
that this modestly untwisted base-displaced conformer is 
the slowly repaired one for the dG-C8-AF-modified G2* 
duplex; the rapid repair phase can then be attributed to 
the more untwisted population which resembles the Gi* 
and G3* duplexes (Figure 10). Figures 3D, 8B and C show 



how the greater NER susceptibilities correlate with greater 
untwisting angles and minor groove enlargements. The 
extent of untwisting also impacts the position of the 
displaced partner C: the more untwisted the structure 
the greater the displacement. Details are given in 
Supplementary Figure SI 7 and Supplementary Table S7. 

Populations in the MD simulations with extrusion of the 
fluorenyl rings into the minor groove suggest a pathway 
for rotation about the fluorenyl long axis in the 
base-displaced intercalated dG-C8-AF duplex 

NMR studies have shown two populations of rotamers 
representing a flip about the fluorenyl long axis and 
governed by the (3' torsion angle for base-displaced 
dG-C8-AF adducts (37). Our MD simulations for the 
G 2 * duplex have suggested a possible pathway for this 
reorientation involving fluorenyl ring protrusion into the 
minor groove, detailed in Supplementary Data (Supple- 
mentary Figures SI 6, SI 8 and Supplementary Movie S4). 

DISCUSSION 

Our analyses of the base-displaced intercalated and major 
groove conformations of the dG-C8-AF adducts, and the 
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base-displaced intercalated, major groove and minor 
groove Wedge conformations for the dG-C8-AAF 
adducts (Figure 2) suggest that extensive distortions and 
destabilizations in the base-displaced intercalated adducts 
are correlated with the relative NER susceptibilities of 
dG-C8-AAF and dG-C8-AF and their sequence depend- 
ence. The major groove and Wedge conformations are less 
distorted and thus are suggested to be less susceptible to 
NER (Figures 5 and 8). These findings support our hy- 
pothesis that the base-displaced intercalated conformation 
is the more repair susceptible one. Our experiments with 
both dG-C8-AF and dG-C8-AAF adducts do not allow 
the general interpretation that the solution population of 
the base-displaced conformation (Supplementary Table 
S8) correlates with the relative repair susceptibility in 
our human NER assay. This is reasonable since solution 
population balances determined by NMR are temperature 
sensitive in the 1-25°C range (33,41), may differ in the 
presence of the repair machinery, and the machinery can 
respond differently to the same adduct in prokaryotic and 
eukaryotic systems (25,28,32), discussed in detail in Liu 
et al. (80). 

A notable feature of the NER experiments is the much 
greater excision efficiency of the dG-C8-AAF compared 
with the dG-C8-AF adduct, irrespective of sequence 
context. This has been observed previously in prokaryotic 
systems (28,29,31-33,77). In the case of eukaryotic NER, 
there is only one early report describing the relative NER 
efficiencies of dG-C8-AAF but not dG-C8-AF adducts in 
the Narl sequence, and the sequence dependence was not 
explained (25). Our relative NER efficiencies of 
dG-C8-AAF adducts in the Narl sequence context are in 
good agreement with those of Mu et al. (25). In addition, 
the Scharer laboratory (Yeo, J.E., Khoo, A., Fagbemi, 
A.F., Scharer, O.D., submitted for publication) have 
recently investigated the NER efficiencies of 
dG-C8-AAF and dG-C8-AF lesions in the Narl 
sequence context embedded in a plasmid, albeit with dif- 
ferent sequences flanking the Narl sequence. The focus of 
the Yeo et al. study was on correlating the NER 
efficiencies with the thermodynamic destabilizations 
induced by the lesions and the binding affinities of XPC/ 
Rad23B to the modified sequences. The overall NER 
efficiencies and general conclusions were similar to those 
reported here; a single difference (relative NER efficiencies 
exhibited in the case of dG-C8-AAF at G 2 * and G 3 *) may 
be attributed to differences in the use of linear versus 
circular DNA substrates or, perhaps, to the differences 
in the Narl flanking sequences. Additional details are 
given in Supplementary Figure S3. Interestingly, the 
sequence-dependent mutagenicity of dG-C8-AF and 
dG-C8-AAF in mammalian Chinese Hamster Ovary 
cells at Gi, G 2 and G 3 of the Narl sequence (81) is in- 
versely correlated with the NER efficiencies observed here; 
this suggests that the structures determine both the repair 
susceptibility and the mutagenicity of the lesions, consist- 
ent with the concept that lesions that escape repair can 
survive to replication and cause mutations (15-17). 

Our analyses showed that both the dG-C8-AAF and 
dG-C8-AF adducts in their base-displaced conformations 
exhibit ruptured Watson-Crick base pairing, local 



untwisting and minor groove opening. However, the 
dG-C8-AAF adduct has weaker stacking interactions 
than the dG-C8-AF adduct; the weaker stacking is 
accompanied by greater untwisting and minor groove 
opening, which stems from the steric effect of the acetyl 
group. The AAF acetyl group restrains the modified 
guanine base G* in the major groove and only the 
fluorenyl aromatic ring system is intercalated (Figure 7A 
and Supplementary Movie SI). However, in the case of the 
dG-C8-AF adduct, the fluorenyl rings are intercalated and 
the modified guanine can partially stack with an adjacent 
5' base (Figure 7B and Supplementary Movie S2). 
Crystallographic and NMR data support these structural 
differences (37,75-77). The greater NER excision effi- 
ciency of dG-C8-AAF relative to the dG-C8-AF adduct 
is explained by the lower van der Waals stacking stabil- 
ization provided by the intercalating lesion, greater un- 
twisting and greater minor groove opening. 

The observed sequence dependence of NER efficiencies 
results from sequence-governed differences in stacking 
patterns which produce differences in the extents of un- 
twisting and minor groove opening. The greatest repair 
susceptibility observed with the dG-C8-AAF adduct is at 
G 2 *; in this case, the fluorenyl rings are stacked with 
neighboring guanine bases on the 5' side of G 2 * on its 
own strand and on the 3' side of G 2 * on the complemen- 
tary strand (Figure 9B). This particular fluorenyl ring/base 
stacking pattern produces the most untwisted adduct con- 
formation of all three dG-C8-AAF adduct-containing 
duplexes, with the greatest minor groove opening; these 
structural perturbations are associated with the greatest 
NER efficiency among all of the six adducts investigated 
here (Figure 3). In the case of dG-C8-AF, the lowest 
repair susceptibility for the dominant slowly repaired 
phase of the three modified duplexes is observed with 
the modification at G 2 *. Without the acetyl group, the 
modified guanine can stack well with the guanine on its 
5' side, causing the fluorenyl aromatic rings to protrude 
into the minor groove in the base-displaced conformer 
proposed to represent the slowly repaired phase 
(Figure 10D); this conformer has the least untwisting 
and minor groove opening of all six duplexes (Figure 8B 
and C). Thus, these weakest structural perturbations are 
correlated with the lowest efficiency of NER observed 
among the six duplexes studied (Figure 3). Furthermore, 
the heterogeneity in repair rates and twist angle popula- 
tions observed only for this modified duplex among the six 
investigated results from a combination of effects: the 
absence of the acetyl group's steric restraint and the 
presence of the 5' G base only in the G 2 * sequence; 
together they permit stabilization of the less untwisted 
population only for the dG-C8-AF-modified G 2 * duplex 
(Figure 10). We note that severe untwisting with minor 
groove enlargement is a structural feature of the 
damaged duplex in the Rad4/Rad23 crystal structure 
(18), suggesting that these distortions could aid in 
facilitating lesion recognition. 

Finally, it is worth mentioning that simulations, such as 
in the current NMR-based work, have the advantage of 
providing molecular structures of individual conformers, 
where solution populations are heterogeneous; in this 
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case, biophysical data such as thermal melting cannot 
deconvolute the contributions of the individual con- 
formational states. Hence, the simulations can provide 
molecular insights on both biophysical and biochemical 
experiments that are otherwise elusive. 



CONCLUSION 

The present experimental and computational studies with 
dG-C8-AF and dG-C8-AAF adducts in the Narl sequence 
context support the hypothesis that sufficient local 
thermodynamic destabilization can facilitate the recogni- 
tion step that leads to the excision of these lesions (18-20). 
It is generally accepted that the recognition step is 
rate-determining in eukaryotic NER (7,14,66,67), and 
our approach of correlating NER efficiencies to the struc- 
tural features of lesions in double-stranded DNA is based 
on this concept, although the impact of downstream 
effects on excision efficiencies cannot be excluded 
(21,80). The local thermodynamic impact of a lesion 
results from a net balance between lesion-induced 
destabilizing distortions and stabilizing van der Waals 
interactions between the lesion aromatic ring systems 
and neighboring base pairs in intercalated conformations 
(69,71). The much greater stacking stabilization of the 
dG-C8-AF adduct compared with dG-C8-AAF, 
combined with its less distorting impact on helix untwist- 
ing and minor groove enlargement, explain the signifi- 
cantly lower susceptibility to the nucleotide excision 
repair of the dG-C8-AF adducts. The sequence-dependent 
NER efficiencies on the other hand, stem from differences 
in DNA structural distortions caused by the lesions that 
include untwisting and minor groove enlargement; they 
stem from sequence-governed differences in stacking 
patterns. The present study highlights the important role 
played by lesion structure and base sequence context in 
determining NER excision efficiencies (21,69,82). 
Furthermore, it demonstrates how MD simulations can 
deconvolute certain properties of individual conformers 
in heterogeneous populations, which may be difficult to 
elucidate experimentally. 
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